Computationally Effic Modification of Speech Us

نویسندگان

  • Sung-Joo Lee
  • Hyung So
چکیده

Among the conventional time-scale modification methods [1][6], the synchronized overlap and add (SOLA) method [4] is used widely because of its good performance with relatively low computational complexity. But the SOLA method still requires much computation in evaluating the normalized crosscorrelation function for synchronization procedure [9]. In this paper, we employ 3 level center clipping method in order to reduce the computational complexity of SOLA method. The result of subjective preference test indicates that the proposed method can reduce computational complexity by over 80% comparing with the conventional SOLA method without considerable performance degradation. We also apply the variable time-scale modification method using transient information [7] to the proposed algorithm. By doing so, we can maintain the intelligibility of time-scale modified speech in the case of very fast playback.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real Time Prosody Modification

Real time prosody modification involves changing the prosody parameters such as pitch, duration and intensity of speech in real time without affecting the intelligibility and naturalness. In this paper prosody modification is performed using instants of significant excitation (ISE) of the vocal tract system during production of speech. In the conventional prosody modification system the ISE are...

متن کامل

A Unix-based Speech Data Collection Platform

It is highly desirable to collect speech data from the telephone network via a digital interface. This avoids an additional A/D conversion normally required by analog telephone data collection hardware. A popular solution to this problem is the use of a T1 line which offers 24 digital phone lines. The leading T1 interface for Sun workstations is a system developed by Linkon Corporation. Using t...

متن کامل

An Overlap-add Technique Based on Waveform Similarity (wsola) for High Quality Time-scale Modification of Speech

A concept of waveform similarity is proposed for tackling the problem of time-scale modification of speech, and is worked-out in the context of short-time Fourier transform representations. The resulting WSOLA algorithm produces high quality speech output, is algorithmically and computationally efficient and robust, and allows for on-line processing with arbitrary timescaling factors that may b...

متن کامل

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...

متن کامل

Overlap-add methods for time-scaling of speech

In this tutorial on time scaling we follow one particular line of thought towards computationally efficient high quality methods. We favor time scaling based on time-frequency representations over model based approaches, and proceed to review an iterative phase reconstruction method for time-scaled magnitude spectrograms. The search for a good initial phase estimate leads us to consider synchro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002